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RECEIVED 
CENTRAL FAX CENTER 

JUN 0 7 2007 

REMARKS 

In response to the above-identified Final Office Action and Advisory Action, Applicants 
traverse Examiner's rejeciion lo the claims and seek reconsideration thereof. In the instant 
response, claims 1, 5> and 9 have been amended. No claims are added or cancelled. Thus, 
claims 1-15 are now pending in the present application. 

The instant application is directed to generating and consuming a three-dimensional 
audio scene with a soimd source whose spatiality is extended. The present invention enhances 
the sense of reality of a three-dimensional- audio scene, at least by providing a sound source of 
more than one dimension. 

L Claim Amendments 

Applicants amend claims 1 , 5, and 9 to include ^'composing the audio scene*' to clarify 
the reason for generatmg a sound object. 

Applicants amend claims 1 , 5, and 9 also to include 'Svherein the sound source 
characteristics information includes spaiiiUity extension information of the sound source, said 
gpatiality extension informarion enabling the sound source to include more than one dimension, 
and includes the size and shape of ihe sound source expressed in a three-dimensional space'' 
(emphasis added). Support for these amendments may be found in the clean copy of the 
currently amended specification at p. 2, lines 29-31. 

IL Objection to the Specification 

Examiner objects to the specification under 35 U.S.C. 132(a) for allegedly introducing 
new matter into the disclosure. As requested, Applicants cancel the amendments on pages 2-8 of 
the substitute specification witli markings, previously submitted in response to the OfEce Action 
of 8/1 0/06. Submined herev^th are clean and marked-up copies of the substitute specification. 
Accordingly, reconsideration and withdrawal of the objections to the specification are requested. 
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III. glflim Rejections -3S MM.C. SI 02 

In the ouistanding Action, claims 1-15 stand rejected imder 35 U.S.C. 102(b) as allegedly 
being anticipated by U.S. Patent No. 6,330,486 issued to Padula (hereinafter " Padula' '\ 
Applicants respectfully traverse the rejections for at least the following reasons. 

To establish anticipation. Examiner must show that the cited reference teaches each of the 
elements of a claim. 

Claim 1 , as amended^ teaches *'wherein ihe sound source characteristics information 
includes spatiality extension information, said spatiality extension information enabling the 
sound source to include more than one dimension> and includes the size and shape of the sound 
source expressed in a three-dimensional space " (emphasis added). Applicants do not believe that 
Padula teaches these elements of claim L Examiner cites Padula. at col. 3, lines 46-49 to teach 
these elements. The paragraph in which the language is cited describes a virtual world data 
structure and provides examples of object characteristics including "shape, size and location,** 
Padula. at col 3, lines 41-47. The reference provides a Sound node as an example of an object, 
and furtlier states that *the location of an object is lypically defined by its coordinates in a right- 
handed, three-dimensional Cartesian coordinate system/* Id at col. 3, lines 49-52 (emphasis 
added). However, the reference ctnn^ins no mention of any size or shape expressed in tliree 
dimensions. 

Further, Padula does not disclose "sound source characteristics mformation includ[ing] 
, . . spatialiiv extension informaiion enabling the soimd source to incl ude more than one 
dimension '' as recited in claim 1 (emphasis added). Examiner states with regard to Applicant*? 
response to the office action of 2/8/07 that '^Applicant's argument for the definition of 'spatiality 

extension* is not persuasive The defuiition provided by applicant has not been positively 

state[dl in the claim, therefore, it will not be considered.'* (Advisory Action, p. 2.) Applicants 
have accordingly amended the claim to define spatiality extension information as "spatiality 
information enabling the sound source to be of more ±an one dimension." 

Examiner has not relied upon and Applicants have been unable to discern any part of 
Padula for teaching these elements of the claim. The reference merely discloses a method for 
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' "positionfinpl audio sources in a tl\ree dimensional virtual environment such tliai the visually 
perceived and aurally perceived locations of ihc audio source correspond/' Id at col. 3, lines 24- 
27 (emphasis added). This is performed by determining the "difference between the virtual 
camera field of view and the user's field of view" so as to then provide warped location.^ of audio 
sources to a rendering machine, id at coL 3, lines 67 to col. 4, lines 1-6, and not by extending the 
spaiiality of the actual sound source, Evea if the positioning of audio sources a3 disclosed by the 
reference, for example, were to efl ect sound that is perceived as three-dimensional, tlie reference 
does not disclose that the source of that sound is enabled to be spatially extended, or of more 
than one dimension. A spatially extended sound source would also include a size and shape of 
more ihan one dimension, which is also not disclosed in the reference. 

Thus, claim 1 is not anticipated by the cited reference. Accordingly, reconsideration and 
withdrawal of the anticipation rejection of this claim are requested. 

Independent claims 5 and 9, as amended, include elements similar to those in claim L 
For at least reasons analogous to those described above as to claim 1, claims 5 and 9 are not 
anticipated by gadula . Accordingly, reconsideration and withdrawal of the anticipation rejection 
of these claims are requested. 

In regard to claims 2-4, 6-8, and 10-15, these claims depend from independeni claims 1, 
5, and 9, respectively, and incorporate the limitations thereof. Thus, at least for the reasons 
mentioned above in regard to die independent claim, these claims are not anticipated by the cited 
reference. Accordingly, reconsideration and withdrawal of the anticipation rejection of these 
claims are requested. 
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CENTRAL FAX CENTER 



JUN 0 7 2007 



CONCLUSION 



In view of the foregoing, il is believed that all claiitis now pending are in condition for 
allowance and such action is earaesdy solicited at the earliest possible date. If there are any 
additional fees due in connection with the filing of ihis response, please charge those fees to our 
Deposit Account No. 02-2666. Questions regm-ding this matter should be directed to the 
undersigned at (310) 207-3800. 



Respectfully submitted. 



Dated:. 




1279 Oakmoad Parkway 
Sunnyvale, California 94085-4040 
(310) 207-3800 



CEI^TTFTCATE OF FAC^^IMILE TRANSMISSION; 
I hereby certify that this correspondence is being Transmitted via 
facsimile to 571-273-53^)0 addressed id: Mail Stop RCE, 
Commissioner i&r PaterAs, P.O. Box 1430, Alexandria, VA 
22313-1450./ / / , , 





Suzanif^ jSh/iston 7r Date 
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3SIETH0D FOR PROCESSING 3-D AUDIO SCENE WITH EXTENDED- 
SPATIALITY OF SOUND SOURCE 

Description 

5 Technical Field 

The present invention relates to a method for 
processing a three-dimensional audio scene having sound 
source whose spatiality is extended; and^ more particularly, 
10 to a method for processing a three-dimensional audio scene 
to extend the spatiality o£ sound source in a three- 
dimensional audio scene. 

Background Art 

15 

Generally, a content providing server encodes 
contents in a predetermined encoding method and transmits 
the encoded contents to content consuming terminals that 
consume the contents. The content consviming terminals 
20 decode the contents in a predetermined decoding method and 
output the transmitted contents. 

Accordingly, the content providing server includes an 
encoding unit for encoding the ^contents and a transmission 
unit for transmitting the encoded contents, On the other 
25 hand, the content consuming terminals includes a reception 
unit for receiving the transmitted encoded contents, a 
decoding unit for decoding the encoded contents, and an 
output unit for outputting the decoded contents to users. 

Many encoding/decoding methods of audio/video signals 
30 are known so far. Among them, an encoding/decoding method 
based on Moving Picture Experts Group 4 (MPEG-4) is widely 
used these days. is a technical standard for data 

compression and restoration technology defined by the MPEG 
to transmit moving pictures at a low transmission rate. 
35 M - PfiG 4, which i - e— ISO/IBC 14 AO 6 1, — dcf inca tcohnologiK^ 

1 
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— G oding / dec oding — a«^ie — vioual (AV) oaono — i-n- tcrmD of 
audio vioual information and aaooaiatQd acenc dGocriptioja 

informatioii* — ontitiy — fefeartr — GompoacD and — eeR^^-r — ^ 

ge ceivco - and g>rcDQnta — ouch a eodcd jcQprooontanion o£ an 
5 a ' Udio viQual occno io geacrlQal - ly rcfarrcd to ao an ^audio 
v-L.euai — terminal^ — or juor — terminal ^> — 5fei-s — terminal may 
corrcQpond to a atond alono applioQfri - on or bo part of an 
application oyotcm. 

According to MPEG-4, an object of an arbitrar y shape 
10 can be encoded and the content consxming termin als congume 

a scene composed of a plurality of objects. Therefore. 

MPEG-4 defines Audio Binary Format for Scene (Audio BIFS) 
with a scene description lancruage for desi gnating a sovind 
object expression method and the character igtics thereof^ 
15 rniG MPEG 1 dGocribco a oyDtcm for oommuniQating gudi -e 

vioual informs t^n, that io, the rcprGQGntation o£ phycioal 
€>r - virtLfcg - l - obje e to that can bo manifQQt - ed- audibly and/oy 

vioually-: At tha cnGodcr oido^ — audio vioual information 

related to a phyQioal oaQnG io GomproooGd; error protoctod 
20 if nGCQaQaary and roultiplGJCod in ono or m u re codod binary 

e faccomo that arc Lranomittod- A rt - the dcoodor oido^ thooo 

& tercamD aro domultiplojcodj — error Gorreotcd, dooon^roooodr 

Gompooitcd, — asd: — prcpcntod — fee — tho ond — uoGr> Sfeta — 

rcvoalod in t^Coding - Of Moving rioturop i^ind - ^iudio" . ICQ/IDG 
26 M!C1/ ccao /wail m 483 . — Sy atcmo - Working Draft verolon 2.0, 
Novombcr 22 1 190 6^ 

According to MPEG 4, an objoet of an grbiurary ohapo 
pan bo onGoded and the Gontcnt oonouming torminalp oonguTn p 

a Dccnc GompooGd of a pluraliLy of objcoto: Thcrcforor 

30 MrCG 4 dclxnco Audio Binary Format for Sceno (Audio DIFQ) 
%jith a OGOnc dcpcgiption languago foar dcpignating a oouncl 
object Gj^roM^ion roathod and the charaotDriotiaD - thorGof > 

Dcono dopoription moana information — fei^afe — de scriboo 
■tehe ppatio temporal pooitioning of audio vipual objccto^ 
35 and thio io rovoalod in ^Coding Of Moviuy ricturcp An ri 

2 
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Audio;ff lOO/IEC JTC1/0C2D/WQ11 NlOOli ^oJtt for CD-X 4 4Q6 - 1 
DyotcmQi Novombor 21. 1007. — MPEG 4, which io-ICO/IEC 14406. 
addrcooQO — fe5fte — ooaing — &^ — a ^ - dio - - victual — objcoto — of variouo 
typooi - natural" video — and— oudio objcctoj — and alpo aynthctlo 

6 muaic and pound - e^Secta . — To rcGogiBtruct a multimedia pco Re 
at the tcrmina - l ry — i - t- io hcnQQ not - suf fioicnt to tranomit the 
audio viauQl data to a rccolving faorminal^ Addi - tri - onal 
■information ia needed in ogdcr to combino this audio vioual 
data at tho terminal and ■■ conotruot and - ptfeoent to the end 

10 t b &cr"" a — mQaningfu J: — multimedia — ooonou Thio infonciation, 

galled oQonc doaegiption, dQtorminoo the placement of audio 
vioual objootQ — i» — Gpaoo — — time — eanel — are- — tranomittcd 

togotho g — with — fefee — ^ie eto , The ooono depoription — e»4y 

dcoaribes - the — B CructurQ — &^ — fci*e — ooeno.. — aation — e# 

15 aapQmtoling thooo objgatn in t - h e- oomQ rcprc&entatiQn Dpaco 
4^ — oallod Qompopition . — I'ho — action of - — trano formin g — &feego 
audio vioual objcoto — from a oommon aropreocntatlon opace to 
a ppeeifio — pro d on to t i on do vi o 0-7 — fc- . - o - >v ■ ppoalccro and — a 
viowinof i^indow, ip Qallod rendering- 

20 Eacamploa — — objcoto — — oonvontional — video # 

Qonvon ti onal audi 07 — ^ ro minced audio traelco — which includo 
□pooohi rouoiQ, — o^nnthetic audio ouoh — ao MIDI/ and— the like, 
ftV - objoctp can be hierarchical in a oonoG that they may be 
defined go oompooitGO of other AV objooto, which arc oallod 

25 oub objcoto. — AV objcoto chat arc c ompooitop of oub objoQto 
&^ — called compound AV objccto, — All otshcr AV objccto are 

oallod primitive AV ob - jcoto. fThc top - moar object in feb e 

hierarchy io — ealied — «ie — ^^poenc^.. An AV — Goeno — ire — feb© 

topmoDt AV objoat in tho hiprarchy of compound AV objcoto, 

30 a»el — fefei - e — io — revealed — ift — ^Coding — Oi — Moving — Pioturoc — 9akA 
Audio; lOO/IDQ J(PC1/CC30/WQ11 Nl4 8 3j Cyotsoma Worlcing Draft 

Vera ion 2.0^ — November 32 j — 1P06. JV oct of AV ob - jooto — 

oallod — ©f* — AV — Qocnc — ted — aR — AV — accne — includco — occnc 
dcDcription information — - whl - oh dcf inoo opatiotempotfal 

35 attributeo of the AV objocto -r 



PAGE 13/37 ' RCVD AT &/712007 6:00:03 PM [Eastern Daylight T^^^^ 



Jun-07-07 02:59pin Froia-B S T Z 310 820 5988 T-300 P. 016/039 F-445 



Meanwhile, along with the development in video, users 
want to consume contentB of more lifelike sounds and video 
<5uality. In the MPEG-4 Audio Binary Format for Scene 
J^Audio BIFSJ^, an AudioPX node and a DirectiveSound node are 

5 used to express apatiality of a three-dimensional audio 
scene, A DIFC OQcnQ dosoription- io a Gompact binary format 
rcproocnt - ing a pre dofinod pot of objooto, — fPhc BXPS format 
oontQino - information on tho qttributco of objoctp, — which 
dofino their audio vicual proper tico and the otruoturc of 

10 ^fee — occno — graph which — ooiataino — thcpo objcotoi. ttie — B-IPB 

BOGnG dcacription data io itoolf Gonvcyod to the rGGcivor 
ao an Blomcntary Ctroam. - 

Tho Dl - g - C QGono doooription includca a Qollection of 
. nodoQ wh - ich dcaGribc the occnQ and ito layout. An objoofe 

15 in tho OQcnG io dooGri - bed by ono or mora nodcoj which may 
bo grouped — togGthcr uoing a grouping node. 

An - objQQt may bo complotoly doocribod within the BIF 6 
iiili omatiQn/ or may al o o mqn-iro otrcoming data from one or 
more AV dooodorp^ In the latter gqdc> tho node points out - 

20 an object doocriptor o - r a Unii dcDcriptor which indlQatco 
whioh Elcmontary — GtrQam(o -) — is — (aro) — aooooiatcd with tho 

Pound - nodoD arc used for building audio QQcnop from 
audio QouroGa coded with WDBO 4 coding tools i — Co\and may bo 

25 includod in cithor tho 20 or 3D PGono grapho. Xn a 3D 

occno, tho sound may be opatially prcsGntcd bo apparontly 
originato from a particular 3D dirootiorii aooording to the 

g KPaitiono of tho objoot and tho liatcno - r -^ Tho Bound nodo 

is UQod to attach oound to 3D and 2D oocnQ graphs -; — As with 

30 vioual objQotOi cho audio objcoto roproocnted by this nodo 
hag a position in gpQQG and time, and aro transformed by 
tho spatial and grouping tranoformo of nodco hicraarohically 
abovo them in tho ooQno. — fPho nodoe - bolow the Oound nodosa 

4 
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hQWQVor, GonotitutQ an audio oubtrcc. — aniio oubtrcc io upc A 
zo doocribc — a — particular — audio — object through — fefee — ifti - xing 

daael — procQOPing of — Qcvcri eBA — audio — Qtroamo -: Rather — tha ft 

rGprcocntlng a hicrQ - cchy of — gpatiotciiiporal — tranf ormati e &no , 

5 the nodoo witihin the audio aub^ - rcc - ■ roprooont a oignal flow 
graph tha - t- dcoorlbca how to oroano tho au d - io- objoot — fe?©i« 
tho Bounds ■■■ coded in the AudioSouroo otrQamo -; — q?hat io, oach 

«^ — audio — oubtre e — ROde9/ i.c . , te^ ^ o f touroo , AudioMijc^ 

Audiopwitch^ — AudioFX, Pount^^ — e tc . , — aaccptp one — or - 3cvcral ' 

10 chariTiGlo -o d E — inpuc — Bound, — and doooribco — hew — — tur;^ — thopo 
e hannolQ of input pound into ono or more ohannelo of output 
oound - , — g ho — only oounda prco o ntod in tho audio viouaX ocono 
arc thooc ooundo which aro - thc output of audio nodop t - h -afe 
a9f€ — children — &^ — a — Sound — node, — fehafe — te-? — fehe — '^highQotff 

15 ou - tputo — ifi — the aud - io — oubtroQ . Herein, the — AudioCourco 

def inoD — &e«3%d — input — foa? — a — gocno — aaa^^ — tho JVudioMl^c — mlxoo 
aound> — Tho AudloCwitoh owitchoQ — oound in a QQQnQ«> — and bho 
AudioFX cotnbineo audio obj - ectS ' configured for oound % i ?hioh 
dofinop partioular oovmd. — Olhc Douaad — dcfinop proportioo of 

20 oound, The AudioOpugco — ie uood to add oound to a - ooono^ 

Divoroo - audio coding — fcoolo — aro rovoajLod in tho — ICO/IEC CD 
3 8 1597 , — OJho audio nodoo prooont- in an audio - oubtrQc 
do - not oaoh gGpreacnc a oound to bo preQonted in the ooonot 
Rather I the audio — oubtroo roproaonto a olgnal flo%# graph 

25 which — Gomputco a — ginglo — tooaaibly — multichannel ) — audio 
objoot ba&cd on a oet of audio inpufao iti AudloSottrGO nodoo 
and paramotrio tronoformationa - ," - The only oounda which aro 
pgGocntod to the liptcncr arc thooc whioh aro bhO""output« - 
of thoDO audio Qubtroosy * ao Qornieotcd to Sound node - . -- ^Ifhio 

30 is — revoalod — m — ^Coding — ©i — Moving — ricturoa — — Afcada^ 

1007.11^21, l e Q/I - B G — J«rCl/SCa&/WG jA — WArtK)!, Qte^te — ieas — QQ 

1 44 96 - 1 Pyotemo. In theee nodes, modeling of sound pource is 
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usually depended on point-source • Point-source can be 
described and embodied in a three-dimensional sound space 
easily. 

Actual point- sources, however, tend to have a 

5 dimension more than two, rather than to be a point of 
literal meaning. More important thing here is that the 
Shape of the sound source can be recognized by human beings, 
which is disclosed by J. Baluert, '^Spatial Hearing," the 
MIT Press, Cambridge Mass, 1996. 

10 For example, a sound of waves dashing against the 

coastline stretched in a straight line can be recognized as 
a linear sound source instead of a point sound source. To 
improve the sense of the real of the three-dimensional 
audio scene by using the Audio BIFS, the size and shape of 

15 the sound source should be expressed. Otherwise, the sense 
of the real of a sound object in the three-dimensional 
audio scene would be damaged seriously . 

That is, the spatiality of a sound source could be 
described to endow a three-dimensional audio scene with a 

20 sound source which is of more than one-dimensional. 

Disclosure of Invention 

It is I therefore, an object of the present invention 
25 to provide a method for generating and consumin gp roGOPQing 
a three-dimensional audio scene having a sound source whose 
spatiality is extended by adding sound source 
characteristics information having information on extending 
the spatiality of the sound source to three-dimensional 
30 audio scene description information. 

The other objects and advantages of the present 
invention can be easily recognized by those of ordinary 
skill in the art from the drawings, detailed description 
and claims of the present specification. 

6 
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In accordance with one aspect of the present 
invention, there is provided a method for generating 
proGOPDing — a three-dimensional audio scene with a sound 
source whose spatiality is extended, including the steps 
5 of: a) generating a sound object; and b) generating three- 
dintengjonal audio scene description inforrnation including 
sound source characteristics information for the sound 
object. wherein the sound source characteristics 
information includes spatiality extension information of 
10 the sound source which Is Information on the size and shape 
of the sound source expressed in a three-dimensional 
space, a ) gcnorating 3I> - audio aoono dQDori - |>t - ion information 
including oound — gouroo ^ Qharaotoriotioo — information o£ — a 
TOUHd — obj o c t ; — and b) coding — fehe — o ound ob j o o t and - ■ the — ^ 
cH - tdio— aconc — dcocription — information — including — fehe — oound 
OQuroQ — QharaotorloticO" information of the pound objcot/ 

whoroin the pound oouroo ohanraotjogajptica information 

includoo — opatialifey — QxtonDion — information — — ***e — sound 
Q our QQ - whioh — » — information on the — & - i - ge — and ohapc of the 
oound oourcQ cacprcDOca In a throo dimonoional ppaoQ -r 

In accordance with on oa nothor aspect of the present 
invention, there is provided a method for 
consumin gp ro u coolng a three-dimensional audio scene with a 
sound source whose spatiality is extended, whioh includoo 
■ the Qtopo of - . — including the steps of; a) receiving a sound 
object and three-dimensional audio scene description 
information including sound source characteristics 
information for the sound object; and b) outputting the 
sound object based on the three-dimensional audio scene 
30 description information, wherein the sound source 
characteristics information includes spatiality extension 
information which is information on the size and shape of a 
sound source expressed in a three-dimensional space, » )- 
doooding a oound objoot and — M — audio ooono doooription 

35 information — includin g pound oourcc oharaotoriaticp 

7 



20 



PAGE17/37'RCVDATS/7120076:00:03PM [Eastern Daylight Tim^^ 



Jun-07-07 03:00pfli Frora-B S T Z 



310 820 5988 



T-300 P. 020/039 F-445 



informatioai for the pound object; — and b) — e ^pucting the 
e^saind object — baaod on — fehe — thrcG dimcnoionirf ! — audio oGcno 

dGPcriptlon — — i - nf ormation^ wherein €:^ve -aound &ew?ee 

charaoteriotiGo — a m £ orma t i on inc Inde -e — spatia - l - i - ^ - extcnaion 

5 - information i^hich io information on the oicc and ohapc o£ 
fe l^i<5 pound Douroc Qxprcpocd in a three d - kncnQional ppacc. 

5a — fl HseordanoG — warth — onothor- aopcct — — fefee — prcooxrir 
invontion, — there — is — provided — a — throe dimenoional — audio 
G ccn c dn f n nfrrnm n r'-'^^^ri nnmrr^o whnnn QiQQtialitv io 

10 extended; wbioh — includeo s — -a — oound — object; a»d — thrco 

^agt enaionQl audio oponc deooription information including 
pound — oouroe oharaoteriptioD information — iea? — fe^e — QQ"^^ 

ebg^cfa — data-y wherein tho — &e wid — ooureo — charQGterioti -e & 

information includeo ppatiality cjctenoion information whie i^ 

15 io information on the oigc and ahape of the oound oouroe 
eaqarcoood in a th - :g<sc - dimQnoiona - > opaeo T 

Brief Description of Drawings 

20 The above and other objects and features of the 

present invention will become apparent from the following 
description of the preferred embodiments given in 
conjunction with the accompanying drawings, in which: 

Fig, 1 is a diagram illustrating various shapes of 
25 aound sources; 

Fig. 2 is a diagram describing a method for expressing 
spatial sound source by grouping successive point sound 
sources; 

Pig- 3 shows an example where spatiality extension 
30 information is added to a ••DirectiveSound" node ' of Audio 
BIFS in accordance with the present invention; 

Fig. 4 is a diagram illustrating how a sound source 
is extended in accordance with the present invention; and 

Fig, 5 is a diagram depicting the distributions of 
35 point sound sources baaed on the shapes of various sound 

8 
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CENTRAL FAX CENTER 

sources in accordance with the present invention. jyjj Q 7 2007 

Best Mode for Carrying Out the Invention 

5 Other objects and aspects of the invention will become 

apparent £rom the following description of the embodiments 
with reference to the accompanying drawings, which is set 
forth hereinafter. 

Following description exemplifies only the principles 

10 of the present invention. Even if they are not described 
or illustrated clearly in the present specification, one of 
ordinary skill in the art can embody the principles of the 
present invention and invent various apparatuses within the 
concept and scope of the present invention. 

15 The use of the conditional terms and embodiments 

presented in the present specification are intended only to 
make the concept of the present invention understood , and 
they are not limited to the embodiments and conditions 
mentioned in the specif ication, 

20 In addition, all the detailed description on the 

principles, viewpoints and embodiments and particular 
embodiments of the present invention should be understood 
to include structural and functional equivalents to them. 
The equivalents include not only currently known 

25 equivalents but also those to be developed in future, that 
is, all devices invented to perform the same function, 
regardless of their structures - 

For example, block diagrams of the present invention 
should be understood to show a conceptual viewpoint of an 

30 exemplary circuit that embodies the principles of the 
present invention. Similarly, all the flowcharts, state 
conversion diagrams, pseudo codes and the like can be 
expressed substantially in a computer-readable media, and 
whether or not a computer or a processor is described 

35 distinctively, they should be understood to express various 
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processes operated by a computer or a processor. 

Functions of various devices illustrated in the 
drawings including a functional block expressed as a 
processor or a similar concept can be provided not only by 

5 using hardware dedicated to the functions, but also by 
using hardware capable of running proper software for the 
functions. When a function is provided by a processor, the 
function may be provided by a single dedicated processor, 
single shared processor, or a plurality of individual 

10 processors, part of which can be shared. 

The apparent use of a term, 'processor', * control' or 
similar concept, should not be understood to exclusively 
refer to a piece of hardware capable of running software, 
bun should be understood to include a digital signal 

15 processor (DSP), hardware, and ROM, RAM and non-volatile 
memory for storing software, implicatively . Other known 
and coromonly used hardware itiay be included therein, too. 

In the claims of the present specification, an element 
expressed as a means for performing a function described in 

20 the detailed description is intended to include all methods 
for performing the function including all formats of 
software, such as combinations of circuits for performing 
the intended function, firmware/microcode and the like. To 
perform the intended function, the element is cooperated 

25 with a proper circuit for performing the software. The 
present invention defined by claims includes diverse means 
for performing particular functions, and the means are 
connected with each other in a method requested in the 
claims. Therefore, any means that can provide the function 

30 should be understood to be an equivalent to what is figured 
out from the present specification. 

Other objects and aspects of the invention will become 
apparent from the following description of the embodiments 
with reference to the accompanying drawings, which is set 

35 forth hereinafter ♦ The same reference numeral is given to 
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the same element, although the element appears in different 
drawings. in addition, if further detailed description on 
the related prioK arte is determined to blur the point of 
the present invention, the description is omitted. 

5 Hereafter, preferred embodiments of the present invention 
will be described in detail. 

Fig. 1 ia a diagram illustrating various shapes and 
sizes of sound sources. Referring tO' Pig. 1/ a sound 
Bource can be a point, a line^ a surface and space having a 

10 volume. Since sound source has an arbitrary ahape and size, 
it is very complicated to describe the sound source. 
However, if the shape of the sound source to be modeled is 
controlled, the sound source can be described less 
complicatedly- 

15 In the present invention, it is assumed that point 

sound sources are distributed uniformly in the dimension of 
a virtual sound source in order to model sound sources of 
various shapes and sizes. As a result, the sound sources 
of various shapes and sizes can be eaqpressed as continuous 

20 arrays of point sound sources- Here, the location of each 
point sound source in a virtual object can be calculated 
using a vector location of a sound source which is defined 
in a three-dimensional scene. 

When a spatial sound source is modeled with a 

25 plurality of point sound sources, the spatial sound source 
should be described using a node defined in Audio BIFS- 
When the node defined in Audio BIPS, which will be referred 
to as an AudioFX node, is used, any effect can be included 
in the three-dimensional scene. Therefore, an effect 

30 corresponding to the spatial sound source can be programmed 
through the AudioFX node and inserted to the three- 
dimensional scene. 

However, this requires very complicated Digital 
Signal Prooessing (DSP) algorithm and it is very 

35 troublesome to control the dimension of the spatial sound 

11 
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source. 

Al3o, the point sound sources distributed in a limited 
dimension of an object are grouped using the Audio BIFS, 
and the spatial location and direction of the soimd sources 
5 can be changed by changing the sound source group. First 
of all, the characteristics of the point sound sources are 
described using a plurality of ^DirectiveSound" node. The 
locations of the point sound sources are calculated to be 
distributed on the surface of the object uniformly. 
10 Subsequently, the point sound sources are located with 

a spatial distance that can eliminate spatial aliasing, - 
which is disclosed by A. J, Berkhout, D. de Vrie$, and P- 
vogelr ^Acoustic control by wave field synthesis,'' J. Aoust. 
Soc. Am., Vol. 93, No. 5 on pages from 27 64 to 2778, May, 
15 1993. The spatial sound source can be vectorized by using 
a group node and grouping the point sound sources. 

Pig. 2 is an illustrative diagram depicting a scene of 
Audio BIPS. In the drawing, a virtual successive linear 
sound source is modeled by using three point sound sources 
20 which are distributed uniformly along the axis of the 
linear soiond source. 

The locations of the point sound sources are 
determined to be (xo-dx, yo-dy, zo-dz) r (xo, yo, ^o)/ and 
(xo+dx, yo+dy, zo+dz) according to the concept of the 
25 virtual sound source. Here, dx, dy and dz can be 
calculated from a vector between a listener and the 
location of the soimd source and the angle between the 
direction vectors of the sound source, the vector and the 
angle which are defined in an angle field and a direction 
30 field. 

Pig. 2 describes a spatial sound source by using a 
plurality of point sound sources. Audio BIFS appears it 
can support the description of a particular scene. However, 
this method requires too much unnecessary sound object 
35 definition. This is because many objects should be defined 

12 
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to model one single object. 

When it is told than the genuine object of hybrid 
description of Moving Picture Experts Group 4 (MPEG-4)is 
more object-oriented representations, it is desirable to 

5 combine the point sound sources, which are used for model 
one spatial sound source/ and reproduce one single object. 

In accordance with the present invention, a new field 
is added to a ^DirectiveSound* node of the Audio BIPS to 
describe the shape and si2e attributes of a sound source. 

10 Pig. 3 shows an example where spatiality extension 
information is added to a ^DirectiveSound" node of Audio 
BIPS in accordance with the present invention . 

Referring uo Fig. 3, a new rendering design 
corresponding to a value of a S our ceDimens ions" field is 

IS applied to the ^DirectiveSound" node. The 
"SourceDimensions" field also includes shape information of 
the sound source. If the value of the "SourceDimensions" 
field is ^0,0,0", the sound source becomes one point, no 
additional technology for extending the sound source is 

20 applied to the "DirectiveSound" node. If the value of the 
"SourceDimensions* field is a value other than "0,0,0", the 
dimension of the sound source is extended virtually. 

The location and direction of the sound source are 
defined in a location field and a direction field, 

25 respectively, in the ^^DirectiveSoxmd" node. The dimension 
of the sound source is extended in vertical to a vector 
defined in the direction field based on the value of the 
"SourceDimensions " field . 

The ^location" field defines the geometrical center 

30 of the extended sound source, whereas the 
•^SourceDimensions" field defines the three-dimensional size 
of the sound source. In short, the size of the sound 
source extended spatially is determined according to the 
values of Ax, Ay and Az. 

35 Fig- 4 is a diagram illustrating how a sound source 

13 
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is extended in accordance with the present invention. A& 
illustrated in the drawing, the value of the 
"SourceDimensions" field is (0, Ay, Az) , Ay and Az being 
not zero l^yi^Q, Azt^O) . This indicates a surface soiind 

5 source having an area of AyxAi. 

The illustrated sound source is extended in a 
direction vertical to a vector defined in the ^direction* 
field based on the values of the ^SourceDimensions" field, 
i.e., ( 0 , Ay , A2 ) , and thereby forming a surface sound 

10 source* As shown in the above, whan the dimension and 
location of a sound source is defined, the point sound 
sources are located on the surfaces of the extended sound 
source, in the present invention, the locations of the 
point sound sources are calculated to be distributed on the 

15 surfaces of the extended sound source uniformly. 

Figs. 5A to 5C are diagrams depicting the 
distributions of point sound sources based on the shapes of 
various sound sources in accordance with the present 
invention. The dimension and distance of a sound source 

20 are free variables. So, the size of the sound source that 
can be recognized by a user can be formed freely. 

For example, mult i- track audio signals that are 
recorded by using an array of microphones can be expressed 
by extending point sound sources linearly as shown in Fig, 

26 5A. in this case, the value of the ^SourceDimensions'' 
field is (0, 0, Az) , 

Also, different soxmd signals can be expressed as an 
extension of a point sound source to generate a spread 
sound source. Pigs. 5B and 5C show a surface sound source 

30 expressed through the spread of the point sound source and 
a spatial sound source having a volume. In case of Fig- SB, 
the value of the ^SourceDimensions* field is (0, Ay, Az) 
and. in case of Fig- 5C, the value of the 
^SourceDimensiona" field is (Ax, Ay. Az) . 

35 As the dimension of a spatial sound source is defined 

14 
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as described in the above, the number of the point eound 
sources (i.e., the number .of input audio channels) 
determines the density of the point sound sources in the 
extended sound source. 

5 If an "AudioSource" node is defined in a "source" 

field, the value of a "numChan" field may indicate the 
nuxnber of used point sound sources. The directivity 
defined in ^angle^" "directivity" and "frequency* fields of 
the ^DirectiveSound" node can be applied to all point sound 

to sources included in the extended sound source uniformly. 

The apparatus and method of the present invention can 
produce more effective three-dimensional sounds by 
extending the spatialiny of so\md sources of contents . 

While the present invention has been described with 

15 respect to certain preferred embodiments, it will be 
apparent to those skilled in the art that various changes 
and modificationsj may be made without departing from the 
scope of the invention as defined in the following claims. 
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